Trends That Affect Temporal Analysis Using SourceForge Data
نویسندگان
چکیده
SourceForge is a valuable source of software artifact data for researchers who study project evolution and developer behavior. However, the data exhibit patterns that may bias temporal analyses. Most notable are cliff walls in project source code repository timelines, which indicate large commits that are out of character for the given project. These cliff walls often hide significant periods of development and developer collaboration—a threat to studies that rely on SourceForge repository data. We demonstrate how to identify these cliff walls, discuss reasons for their appearance, and propose preliminary measures for mitigating their effects in evolution-oriented studies.
منابع مشابه
Quantitative Analysis of Open Source Projects on SourceForge
Relatively easy accessibility of high volumes of information about open source software makes it an interesting target for quantitative analysis meant to discover some hidden properties and trends of this software development model. In this work we demonstrate how such information can be acquired from the largest open source hosting facility — SourceForge — with nearly minimal effort. We compar...
متن کاملThreats to Validity in Analysis of Language Fragmentation on SourceForge Data
Reaching general conclusions through analysis of SourceForge data is difficult and error prone. Several factors conspire to produce data that is sparse, biased, masked, and ambiguous. We explore these factors and the negative effect that they had on the results of “Impact of Programming Language Fragmentation on Developer Productivity: a SourceForge Empirical Study.” In addition, we question th...
متن کاملPrecipitation Trends Analysis in Southwest Asia during the Last Half Century
Precipitation is a climatic elements that have temporal - spatial distribution. In this research database of Global Precipitation Climatology Centre (GPCC) with a resolution 0.5×0.5 degree for 50 year is used, that was constituted with dimensions of 12800*600. Temporal data are on the columns and pixels (spatial data) located on the rows. The results show an increasing trend in spring and fall ...
متن کاملRevealing the impact of changing land use of the annual spatiotemporal boundary layer height (Kermanshah Case Study)
Introduction Atmospheric boundary layer (ABL), is the lowest part of the atmosphere. Its behavior is directly influenced by its contact with earth surface. On earth it usually responds to changes in surface radiative forcing in an hour or less. In this layer physical quantities such as flow velocity, temperature, moisture, etc., display rapid fluctuations (turbulence) and vertical mixing is st...
متن کاملProgramming Language Trends in Open Source Development: An Evaluation Using Data from All Production Phase SourceForge Projects
In this work, we analyze data collected from the CVS repositories of 9,997 Open Source projects hosted on SourceForge in an effort to understand trends in programming language usage in the Open Source community between 2000 and 2005. The trends we consider include: 1) the relative popularity of the ten most popular programming languages over time, 2) the use of multiple programming languages by...
متن کامل